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Abstract 

The present study deals with the analysis and mapping of Swiss franc interest rates. 
Interest rates depend on time and maturity, defining term structure of the interest 
rate curves (IRC). In the present study IRC are considered in a two-dimensional 
feature space - time and maturity. Geostatistical models and machine learning al- 
gorithms (multilayer perceptron and Support Vector Machines) were applied to 
produce interest rate maps. IR maps can be used for the visualisation and patterns 
perception purposes, to develop and to explore economical hypotheses, to produce 
dynamic asses-liability simulations and for the financial risk assessments. The feasi- 
bility of an application of interest rates mapping approach for the IRC forecasting 
is considered as well. 
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1 Introduction 



The present study deals with an empirical analysis and mapping of Swiss franc 
(CHF) interest rates (IR). The complete empirical analysis includes compre- 
hensive quantitative analysis and characterisation of daily behaviour of CHF 
interest rates from October 1998 to December 2005. A mapping part of the 
paper considers the application of spatial interpolation models for IR mapping 
in a feature space "maturity-date" . In a more general setting interest rates can 
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be considered as functional data (IR curves are formed by different maturi- 
ties) having specific internal structures (term structure) and parameterised by 
econometric models. 



The main issues related to IR mapping are the following: 1) empirical analysis 
of IR spatio-temporal patterns, 2) reconstruction and prediction of interest 
rate curves; 3) incorporation of economical/financial hypotheses into the IR 
prediction process; 4) development of what-if scenario for financial engineering 
and risk management. So me of the prelim i nary ideas elaborated in this study 



were firstly presented in (IKanevski et. all 120031 ) 



In gener al there are two principal approaches to make term-structure pre- 
dictions (IDiebold and Canlinl . 120061 ): no-arbitrage models and 2) equilibrium 
models. The no-arbitrage models focuses on fitting the term structure at a 
point in time (one dimensional model depending on maturity) to ensure that 
no arbitrage possibilities exist. This is important for pricing derivatives. The 
equilibrium models focuses on modelling the dynamics of the intravenous rate 
using afline models after which rates at other maturities can be derived un- 
der various assumptions about risk premi um. Detailed discuss i on al ong with 
corresponding references can be found in (IDiebold and Canlinl . 120061 ) . 



An important and interesting approach complementa ry to classical empirical 



analysis of interest rates time series was developed in (IDi Matteo et. all 12005 



Di Matteo and Astel . 20021 ; Cajueiro and Tabak . 2007 ) where both traditional 
econophysics studies (power law distributions, etc.) and a coherent hierarchi- 
cal structure of interest rates were considered in detail. An empirical quanti- 
tative analysis of multivariate interest rates time series and their increments 
(carried out but not presented in this paper) includes study of autocorrela- 
tions, cross-correlations, de trending fluctuation analysis, embedding, analysis 



of distribution of tails, e tc. (IKanevski et. all 12 003 



Mantegna and Stanley! . Il999l ; iKantz and Schreiberj . 120031 ) 



Caiueiro and Tabaki . 12007 



The most important part of the current study deals with an IR mapping 
in a two dimensional feature space (maturity(months), time(date/days)} us- 
ing spatial interpolation/extrapolation models (inverse distance weighting - 
IDW, geostatistical kriging models), nonlinear artificial neural network models 
(multilayer perceptron - MLP) and robust approaches based on rece nt devel- 
opme nts in Statistical Learning Theory (Support Vector Regression) (jVapnikl . 
19991 ). Embedding of IR data into a two-dimensional space brings us to the 
application of spatial statistics and its modelling tools. Higher dimensional 
feature spaces can be considered and applied as well. Simple models, like lin- 
ear and inverse distance weighting were used mainly for the comparison and 
visualisation purposes. 



Evolution of CHF interest rates data is given in Figure 1 where temporal be- 
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Fig. 1. Left: Evolution of IR time series for different maturities: 1,6,12 months and 
5 and 10 years. Right: Examples of the observed CHF interest rates curves for 
different days. 



haviour of different maturities is presented. The IRCs are composed of LIBOR 
interest rates (maturities up to 1 year) and of swap interest rates (maturities 
from one year to 10 years). Such information is available on the specialised 
terminals like Reuters, Bloomberg, etc. and is usually provided for some fixed 
time intervals (daily, weekly, monthly) and for some definite maturities (in 
this research we use the following maturities: 1 week, 1, 2, 3, 6 and 9 months; 
1, 2, 3, 4, 5, 7 and 10 years). 



There are some imp ortant stylized facts that have to be considered when 
modelling IR curves (IDiebold and Canlinl . l2006l ): the average yield curve is in- 
creasing and concave; the yield curve assumes a variety of shapes through time, 
including upward sloping, downward sloping, humped, and inverted humped; 
IR dynamics is persistent, and spread dynamics is much less persistent; the 
short end of curve is more volatile than the long end; long rates are more 
persistent than short rates. There is a coherence in evolution of interest rate 
curves (see some typical examples in Figure 1). 



In the present research the application of IR mapping is concentrated on: 1) 
visualisation and perception of complex data: IR data represented as maps are 
easier for the analysis and interpretation, 2) reconstruction of interest rates 
at any time and for any maturities by using spatial prediction models and 3) 
prediction of IR curves. The last problem can be considered from two different 
points of view: a) predictions without any a priori information (problem of 
extrapolation using historical IR data) and b) forecasting under some prior 
hypotheses about the future evolution of market or interest rates (some banks 
provide information about future level of interest rates at several maturities). 
In the latter case one can consider different scenario "what-if" and prepare 
information on IR curves for d ynamic simulations of portfolio development, 
asset-liability management etc. (IKanevski et. al.L 120031 ) . 
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2 Data description and interest rates mapping 



Daily CHF IR data from October 1998 to December 2005 were prepared as 
a training data set (data used to develop a model). The study of this period 
is quite interesting because during this time different market states (bullish, 
bearish) and even historically the lowest LIBOR rates were observed. Training 
data were used for IR mapping (IRC reconstruction and in-sample predictions) 
and to develop a prediction model which then was used to make predictions 
for January 2006 (out-of-sample predictions). Thus, real data for 2006 were 
used only for validation purposes. 

Time series for several maturities are presented in Figured] (left). It is evi- 
dent that the behaviour of the curves is rather coherent but very complex in 
time reflecting financial market evolution. Figured] (right) demonstrates term 
structure of several IR curves: dependence of IR on maturity for fixed dates. In 
December 21 an inversion of the curve, when very short interest rates (weekly) 
are higher than monthly interest rates was observed. 

Despite of some coherencies in temporal behavior between different maturi- 
ties, the relationship between short (1 month) and long (10 years) maturities 
are nontrivial which can be demonstrated by analysing correlation matrix. 
Sometimes multi-valued relationships can be detected. 



In (IDiebold and Canlinl . 120061 ) monthly IR were modelled using parametric 
Nelson-Siegel model based on 3 factors corresponding to long-term, short-term 
and medium term IR behaviour. These parameters can be interpreted in terms 
of level, slope (maturity 10years -maturity 3mont/l J and curvature (2maturity 2j/ears - 
maturity 3morit/ls — maturity 10year J. First, these parameters were modelled as 
a time series using historical data and then they were applied to make IR 
forecasting. Such approach seems to be quite interesting and transparent both 
from scientific and practical point of views. The results obtained using linear 
time series models have demonstrated efficiency of this methodology. Possible 
extensions to nonlinear models were not considered and are a topic of ongoing 
research. 

Let us consider interest rates mapping in a two-dimensional feature space 
described by maturity (X-axes, in months) and time (Y-axis, in days). In this 
presentation along X-axis one can observe IR curves for fixed date (when Y 
is fixed) and along Y-axis one can observe temporal behaviour of IR for fixed 
maturities. Distance between points in this space is rather synthetic and should 
be taken into account both during model development and interpretation of 
the results. 

Geostatistical approach (family o f kriging models) is based on em pirical analy- 



sis and modelling of variograms (IKanevski and Maignanl . I2004J ) . Variography 
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was efficiently used to quantify the quality of machine learning algorithms 
modelling by estimating the spatial structure of the MLA residuals: good 
models have to demonstrate pure nugget effects (no s patial correlations) with 



a var iance fluctuating around a raw data noise level (IKanevski and Maignan 



20041 ) 



Let us consider interest rate mapping procedure. The value at each prediction 
(unsampled) point Z(x, y) can, in general, be assessed in two ways: 1) model 
Z\ is a weighted sum of the measured/observed neighbouring data Zi or 2) 
model Z 2 is a weighted sum of kernel functions Kj\ 

n 

Zi{x,y) = E Wi{x,y)Zi(xi,yi) 
i=i 

m 

Z 2 (x, y) = E ctjKj(x, y; x h y { ) + b 

3=1 



where n and m are the number of points or kernels used for the prediction, Wi 
and CKj are corresponding weight coefficients. 

In case of inverse distance weighting mapping the weights are inversely pro- 
portional to the power of distance between observations and prediction point; 
in case of kriging models they are a solution of a system of linear equations 
derived from the principle of best linear unbiased predictor; and in case of Sup- 
port Vector Regression they are a solu tion of quadratic optimization problem 



following Stat i stical Learning Theory (lHaykinl . Il999l ; iKanevski and Maignan 



2004 : IVapnikl . 119991 ). In case of MLP kernels are replaced by a combination 
of nonlinear transfer function, e.g. sigmoid. IDW, kriging and MLP are well 
known and widely applied models for spatial data analysis and mapping. 

Statistical Learning Theory is a general framework for solving classification, 
regression and probability density estimation problems using finite number of 
empirical data. SVR is a non-parametric regression method, which exploits 
kernel expansion. It attempts at minimizing the empirical risk (the residuals 
on the training data), simultaneously keeping low the complexity of the model. 
By doing this, the over-fitting on the training data can be avoided and one 
may expect promising predictive abilities. Unlike MLP, SVR after fixing few 
hyper-paramet ers, has a unique solution of quadratic optimisation problem 



flVapnikUl999f ). 



All models (IDW, kriging, MLP, SVR) depend on some hyper-parameters 
(number of neighbours, IDW power, variogram model, number of hidden neu- 
rons, regularization and kernel parameters, etc.) which can be tuned using 
different statistical techniques, e.g. data splitting (training-testing- validation), 
cross-validation, jack-knife. 

The result of IDW mapping with a power equal to two is presented in Figure [21 
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Fig. 2. Interest rates mapping using inverse distance weighting (left) and multilayer 
perceptron model (right). 

This map gives quite consistent reproduction of the evolution of interest rates 
in time. IDW model is fast to tune and easy to apply and useful for the 
visualisation purposes. But it does not take into account spatial structure of 
data and can produce some artifacts. 

Second approach presented in this paper is based on MLP. It is a well known 
nonlinear modelling tool having both advantages and some drawbacks as a 
black-box tool (interpretability of the results, overfitting of data, multiple local 
minima, etc.). Nevertheless, MLP is a flexible and very powerful approach for 
data mining and data modelling. The efficiency and quality of mapping of 
MLP can be controlled using (geo)statistical tools both f or raw data and for 
the residuals f Hay kin . 1999 ; Kanevski and Maignan . 20041 ). More details and a 
generic methodology of MLP a pplication for spatial data anal ysis and mapping 
can be found, for exampl e, in (IKanevski and Maignanl . 120041 ) and for different 
financial applications in (IMcNelisl . I2005T ) . The MLP map is given in Figure [2j 



Following the data modeling traditions we have split original IR data into 
training (80% of data) and testing subsets (20% of data). Testing data set 
was used to control the quality of different MLP and to avoid overfitting. 
Testing data set was used to find optimal number of hidden layers and hidden 
neurons. As an optimal a network with 2 hidden layers each consisting of 25 
hidden neurons was applied. By comparing the results of MLP mapping and 
visualisation of original data it can be noted that MLP was able to detect 
and to model large and medium scale structures using nonlinear smoothing 
approach. Small scale and low level variability were ignored and treated as 
insignificant noise. 

Reconstruction of two different interest rate curves (in-sample prediction) - 
these curves were extracted from training data set and then modelled using 
MLP is given in Figure [3J Both curves are quite well reconstructed, including 
inversion of IRC. 



Developed MLP model was used for the one month ahead IRC prediction. 
An example of the predicted IRC along with real/observed data is given in 
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Fig. 3. Left: CHF interest rate curves reconstruction using MLP. Right: CHF interest 
rate curve prediction using MLP. 




Fig. 4. Left: Interest rates mapping using Support Vector Regression. Right: Testing 
of Support Vector Regression model. 

Figure [3] The result is quite promising and more simulations under different 
market conditions and time horizons are in progress to validate the predictabil- 
ity of this mapping approach. 

Finally, Support Vector Regression was applied for CHF IR mapping. The 
same as for MLP methodology was applied to tune SVR hyper-parameters: 
epsilon-insensitive zone of robust loss-function, regular isat ion parameter C, 
and the shape of a Gaussian kernel used. It should be noted that SVR approach 
is quite computationally intensive and tuning of its parameters is not a trivial 
task. The IR map produced by SVR is presented in Figure HI In comparison 
with MLP SVR is less smoothing data and well reproduces spatio-temporal 
pattern. 



In order to validate the quality of SVR the testing data predictions (also not 
used to develop a model) are compared with the observed ones (Figure HI 
right). The results obtained are quite promising. An important improvement 
for all models proposed can be achieved when applying the same analysis 
(development and retraining of models) within a moving window which can 
be related to the market conditions. This can partly help to avoid also the 
problem of spatial nonstationarit y. Another approach can be based on hy- 
brid MLA+geostatistics models (IKanevski and Maignanl . 120041 ) already suc- 
cessfully applied for spatial environmental data. All spatial data analysi s was 
carried out using Geostat Office software (IKanevski and Maignanl . 120041 ) . 
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3 Conclusions 



The paper presents the results on mapping of CHF interest rates. An impor- 
tant attention was paid to the visualisation of IR curves in a two-dimensional 
maturity-date feature space. Tools and models from spatial statistics and ma- 
chine learning were applied to produce the maps. Developed interest rates 
maps can be easily interpreted and give good summary view on IRC evo- 
lution. Some promising results on forecasting were obtained using artificial 
neural networks. An important improvement can be achieved by using hybrid 
models based on geostatistics and machine learning and incorporating time 
series tools into modelling and forecasting procedures. In the future IR map- 
ping methodology will incorporate economical/financial hypotheses of market 
behaviour in order to elaborate "what- if " scenarios for the financial risk man- 
agement. 
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